Statistical Syllabification of English Phoneme Sequences using Supervised and Unsupervised Algorithms
نویسنده
چکیده
A syllable is defined as a unit of spoken language bigger than a speech sound (phoneme), and is made up of three components: a nucleus, which consists of a single vowel or syllabic consonant, optionally surrounded by one or more consonants. The consonants that precede the nucleus are collectively referred to as the onset, while those that succeed it are called the coda. The nucleus and coda are sometimes lumped together to form what is called the rhyme.
منابع مشابه
Unsupervised Text Syllabification with Information from Audio
The objectives of this internship were: studies on supervised and unsupervised text syllabification techniques, with a focus on unsupervised methods, implementation and evaluation of several unsupervised audio syllabification algorithms, implementation and evaluation of an unsupervised text syllabification algorithms, improve the performance of text syllabification with features from acoustics....
متن کاملMorphological Segmentation Can Improve Syllabification
Syllabification is sometimes influenced by morphological boundaries. We show that incorporating morphological information can improve the accuracy of orthographic syllabification in English and German. Surprisingly, unsupervised segmenters, such as Morfessor, can be more useful for this purpose than the supervised ones.
متن کاملData-driven approaches for automatic detection of syllable boundaries
Syllabification is an essential component of many speech and language processing systems. The development of automatic speech recognizers frequently requires working with subword units such as syllables. More importantly, syllabification is an inevitable part of speech synthesis system. In this paper we present data-driven approaches to supervised learning and automatic detection of syllable bo...
متن کاملPredicting Pronunciations with Syllabification and Stress with Recurrent Neural Networks
Word pronunciations, consisting of phoneme sequences and the associated syllabification and stress patterns, are vital for both speech recognition and text-to-speech (TTS) systems. For speech recognition phoneme sequences for words may be learned from audio data. We train recurrent neural network (RNN) based models to predict the syllabification and stress pattern for such pronunciations making...
متن کاملUse of Trajectory M Automatic Accent Cl
This paper describes a proposed automatic language accent identification system based on phoneme class trajectory models. Our focus is to preserve discriminant information of the spectral evolution that belong to each accent. Here, we describe two classification schemes based on stochastic trajectory models; supervised and unsupervised classification. For supervised classification, we assume te...
متن کامل